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O, 

f^ ^ An asymmetric information model is introduced for the situation in which there is 

O^ ' a small agent who is more susceptible to the flow of information in the market than 

^ I the general market participant, and who tries to implement strategies based on the 

O I additional information. In this model market participants have access to a stream 

of noisy information concerning the future return of an asset, whereas the informed 
trader has access to a further information source which is obscured by an additional 
noise that may be correlated with the market noise. The informed trader uses 
the extraneous information source to seek statistical arbitrage opportunities, while 
at the same time accommodating the additional risk. The amount of information 
available to the general market participant concerning the asset return is measured 

r^ ' by the mutual information of the asset price and the associated cash flow. The worth 

r^ . of the additional information source is then measured in terms of the difference 

f p^ I of mutual information between the general market participant and the informed 

trader. This difference is shown to be nonnegative when the signal-to-noise ratio 
of the information flow is known in advance. Explicit trading strategies leading to 
statistical arbitrage opportunities, taking advantage of the additional information, 

f^ ' are constructed, illustrating how excess information can be translated into profit. 

/»f\ ' Keywords: statistical arbitrage; asymmetric information; information-based 

jys ' asset pricing; hedge funds; insider trading; mutual information 

(N' 

' ■ 

1. Introduction 
!>■ 

^D \ There are many different approaches to the modelling of so-called insider trad- 

ing strategies. Starting with the work of Kyle (1985) and Back (1992), a number 
of investigations have been carried out (to name a few, Amendinger et al. 1998, 
Seyhun 1988, FoUmer et al. 1999, Back et al. 2000, Leon et al. 2003, Corcuera et 

k> ; al. 2004, Biagini & 0ksendal 2005, 2006, Ankirchner et al. 2006, Campi & getin 

\^ • 2007). It is sometimes assumed in the literature that the 'insider' has direct access 

P^ \ to the values of future asset prices. While such a scenario may indeed occasionally 

prevail, the more common situation is that informed agents do not have advance 
access to the exact values of future asset prices. How, then, do agents having high 
information susceptibility — when compared to the average market participant — 
utilise their strengths in reality? An increasingly popular strategy adopted by some 
large hedge funds is to make use of publicly available information in addition to 
high-frequency price data. What gives these funds an edge is their significant com- 
putational power for data and text mining, thus allowing them to extract useful 
information from publicly available sources faster than their competitors. Against 
this background it is natural to ask how much added information an extra source 
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provides, how does it affect trading strategies, and more generally in what way can 
such information-based strategies be modelled mathematically. 

The purpose of the present paper is: (i) to introduce a phenomenological ap- 
proach to model the agent susceptive to information, (ii) to quantify the amount 
of added information, and (iii) to derive trading strategies that lead to statisti- 
cal arbitrage opportunities for such an agent. Our analysis is carried out within 
the information-based asset pricing framework of Brody, Hughston and Macrina 
(Macrina 2006; Brody et al. 2007, 2008a,b; Rutkowski & Yu 2007; Hughston & 
Macrina 2008). In this framework — hereafter referred to as the BHM framework — 
the price process of an asset is derived from the specification of (a) future cash flows 
associated with the asset, and (b) the flow of information accessible to market par- 
ticipants. The price is then given by the discounted risk-adjusted expectation of the 
cash flows conditional on the available information. 

The simplest model that arises in the BHM framework is briefly reviewed in §2. 
In this setup the asset is characterised by a contract that delivers a single random 
cash flow at a predesignated time. Such an asset can be interpreted as having 
the structure of a credit-risky discount bond. In §3 we consider the problem of 
quantifying the amount of information contained in the bond price concerning the 
value of the future bond payout. To this end we determine the mutual information 
(Yaglom & Yaglom 1983, Cover & Thomas 1999) of these two random variables. 
Initially the market has no information, beyond that already implicit in the asset 
price, concerning the value of the cash flow. However, as time goes by, the market 
gathers information. When the amount of information reaches the level of the initial 
entropy of the cash flow, the market flnally 'learns' what the value of the cash flow 
is. The information-theoretic analysis is extended in §4 where we show that the 
mutual information at time t is given by the initial uncertainty, less the expected 
uncertainty that remains at that time. 

In §5 a simple model for an informed trader is introduced. In our approach the 
informed trader is more susceptive to the flow of market information than other 
market participants, and thus on average is able to estimate the value of the im- 
pending cash flow more quickly and accurately than the other market participants. 
Simulation studies show a comparison of the sample paths for the market price 
process and the corresponding valuations made by the informed trader, revealing 
various intuitive as well as counterintuitive properties of these processes. The dy- 
namics of the valuations estimated by the informed trader are worked out in §6, 
where we obtain the associated innovations representation. With a basic model for 
an informed trader at hand we are able to quantify the amount of added information 
held by the trader. This is worked out in §7, where we construct an elementary trad- 
ing strategy making use of the additional information, demonstrating the existence 
of statistical arbitrage opportunities in such circumstances. 

2. Information and asset pricing 

The simplest model arising in the BHM framework can be summarised thus. We fix 
a probability space (fi,^, Q), where Q denotes the risk-neutral measure. We write 
E for expectation with respect to Q. The market is not assumed complete, but we 
assume the absence of arbitrage and the existence of an established pricing kernel — 
these assumptions ensure the existence of a preferred pricing measure (cf. Cochrane 
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2005). We let Xt denote the random cash flow associated with the asset and paid 
at time T (for example, the payout of a credit-risky discount bond). Before time 
T market participants do not have direct access to the value of the cash flow. We 
assume nevertheless that partial information concerning the value of X^ , obscured 
by the market noise, can be obtained before time T. This noisy 'observation' of 
Xt generates the market filtration {Tt}, and the price at time t is given by the 
risk- neutral expectation of the discounted cash flow, conditional on Tt ■ 

We assume that the 'signal' of the noisy observation concerning Xt is revealed 
to the market at a constant rate a, that the 'noise' is generated by an independent 
Brownian bridge process {Ptr}, and that the market filtration is generated by an 
information process {^t} defined for < t < T by 



^t^crtXT + f3tT- 



(2.1) 



In other words, {Tt} — o'{{S,s}o<s<t)- The use of a bridge process for the noise 
is motivated by the idea that at time all the available information about Xt is 
incorporated into the a priori distribution, and that at time T the value of Xt is 
revealed and there is no remaining noise: the choice of a Brownian bridge is made 
for simplicity and tractability. If we further assume that the default-free system of 
interest rates is deterministic and let {Ptr} denote the price at time t of a discount 
bond that matures at T, then the price of the credit-risky discount bond at time 
t is given by Bit = PtT^[XT\^t]- In the case where Xt takes the discrete values 
{xi}i^i „ with the a priori probabilities {pi}i=i....^n, a calculation shows that 



BtT = Pi 



tT 



ELiK2;iexp 


5^ {ax^^t - \<J^xlt) 




Ei=iPiexp 


T-t {(^Xi^t Icr'^xft)^ 





(2.2) 



This follows from the fact that the conditional risk-neutral probability defined by 

T^it = Q{Xt — Xi\Tt) takes the form 



PiCxp 



TTjt 



T-t 



{(TxS - ^cr'^x^t) 



I]"=iPiexp ^{crxi^t - ^cr^xft) 



(2.3) 



By taking the stochastic differential of (2.2) one finds that the dynamical equation 
satisfied by the bond price is 



dBtT = rtBtTdt- 



T-t 



PtTVtTdWt: 



(2.4) 



where rt = —din Pot /dt, and where the process {Wt} defined by the expression 



Wt=^t 



1 



T-s 



(_s ds - gT 



1 



T - 3 



■Xsdi 



(2.5) 



turns out to be a standard {J^f}-Brownian motion. Here Xt — X]i=i •^i"'** denotes 
the conditional expectation of the cash flow XT^ and 



VtT^Y. 



Xi - Xt] IT. 



(2.6) 
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is the conditional variance of Xt (see Macrina 2006 and Brody et al. 2007 for 
derivations of the foregoing resuhs). 

We observe that in the information-based framework it is possible to deduce 
the diffusive dynamics (2.4) for the price process, starting from the specification of 
the cash flow Xt and the information process {$*}. The theme that underlies this 
framework is that the market acts as a 'signal processor' for future cash flows so 
as to generate the dynamics of asset prices. This point of view is natural as a basis 
for understanding the elements of price formation, since investment decisions are 
often made in accordance with the perceptions of market participants concerning 
the future cash flows associated with the given asset. 

As far as the market filtration is concerned, the information contained in {^4} is 
equivalent to that in {Btr}'- we have cf({^s}o<s<t) ~ <^{{BsT}i)<s<t)- This follows 
from the fact that one can write Bit = B{t,£^t), where 



E" T (' 12 2-l\ 

i=i PiXi exp ^—i [<yxiX ~ 2"^t^) 



B{t,x)^PtT- 

Y.7=i Pi e'^P 7?^ {o-XiX - jcr'^xjt) 

from which by differentiation wc deduce that 



(2.7) 



B'{t,x) 



crTPfT ^«^=i P^ [^* ^ -^(^' ^y^t^f exp ^ (aXiX - ^(j'^xft 



T^t 



J27=i Pi <3xp j^ (axix - ^a-^xft) 



(2.8) 



which is positive. Therefore, B{t,x) is monotonically increasing in x, and hence 
invertible. It follows that from knowledge of the trajectory {^s}o<s<t one can con- 
struct {BsT}o<s<t', conversely from knowledge of the trajectory {BsT}o<s<t one 
can construct {Cs}o<s<t- 



3. Amount of information about the future cash flow 
contained in the price process 

We would like to quantify how much information regarding the value of the cash flow 
Xt is contained in the value at time t of the information process (2.1). A reason- 
able measure for such quantification is given by the mutual information J(^i, AT^) 
between the two random variables (Shannon & Weaver 1949; Gel'fand et al. 1956; 
Gel'fand & Yaglom 1957), which in the present context is given by the expression 



^ /.OO 

J(6,At) =X! / Pixi^^i) In 



PA^)Pxii) 



dx. 



where 



Pix{x,i) = 



dx 



(6 < a;) n (At = x,) 



(3.1) 



(3.2) 



is the joint density function of the random variables (^t, A^), and p^ (x) and p^ (i) 
are the respective marginal probabilities. By use of the relation 



iCt <x)r] {Xt - X,) 



't<x\XT = x,)Q{XT^x,) 



(3.3) 
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we deduce that 

Pixi^^^J^Pi , ,,^ .x,^ exp -g— -T77T7 , (3.4) 



V27rt(T-i)/T'''n U{T-t)/T)' 



since conditional on X^ = Xi the random variable ^t is normally distributed with 
mean axit and variance i(r — t)/T. From (3.4) the marginal densities 

pA^) ^^Piixix,i) and p^ii) = l Pix{x,i)dx (3.5) 

i=i ■'-°° 

can be deduced at once. In particular, we have p-^{i) — Pi, as it should be. 

An alternative way of deriving the mutual information in this context is to make 
use of the identity 

J{^t,XT)^Hi^t)-Hi^t\XT). (3.6) 

Here H{£_t) = ^IE[lnpj(^t)] is the entropy of the random variable £,t (Wiener 1948, 
Khintchine 1953), and H{(_t\XT) ~ —E[ln p^{^t\XT)] is the entropy of ^t conditional 
on Xt- The conditional density function p (x\Xt), a; G R, is defined by p. {x\Xt) = 
<iQ{£,t < x\XT)/dx. Since conditional on Xt the random variables £,t and f3tT are 
both normally distributed, with the same variance t(T — t)/T, and since the entropy 
of a normally distributed random variable is independent of its mean, we find that 
-ff(^t|Xr) = H((3tT)- In other words, the mutual information in the present context 
is given by the difference of the two entropies: 

Ji^t,XT)^H{^t)-HiPtT). (3.7) 

As a consequence, the information about the cash flow Xt contained in ^t can be 
determined (a different approach to extracting information concerning the asymp- 
totic dividend stream from option price data is considered in Geman et al. 2007). 

From an information-theoretic point of view a pair of processes related through 
an invertible smooth function, and thus sharing the same filtration, in general 
possess different information content (entropy). On the other hand, since what is 
directly observed in the market is the price BtT, which is an invertible function of 
^t, one might argue that it is more relevant to determine the mutual information 
J{BtT, Xt), that is, the amount of information about the future cash flow contained 
in the market price. However, since mutual information is given by a difference 
of entropies, and since changes in the two entropies resulting from the transform 
cancel, we have J{BtT,XT) = J{£,t,XT)- Therefore, the amount of information 
about Xt contained in BtT is given by (3.7). 

In Figure 1 we plot the mutual information J{BtT, Xt) as a function of t S [0, T] 
for three values of the information flow-rate parameter a. The information gained 
by market participants increases more rapidly as a is raised. On the other hand, the 
dynamical relation (2.4) shows that the value of a determines the overall magnitude 
of the price volatility. Thus it is possible to quantify the market information gain 
and compare this with the price volatility. 

To see how entropy transforms under a nonlinear invertible map, suppose that 
X is a random variable with density p{x), and that Y is another random vari- 
able given by y = f{X), where f{x) is smooth and invertible. Then the den- 
sity function for Y is given by q(y) = p{f^^{y))/f'{f^^{y))- Substituting this in 
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Figure 1. Mutual information J concerning the value of the future cash flow Xt contained 
in the price BtT- Early observation of the price yields little information concerning the 
value of Xt- As time progresses the market extracts more information about Xt\ eventu- 
ally, close to maturity when the value of Xt is to be revealed the amount of information 
gained by the market reaches — J^^p^lnpi. The parameters are chosen to be h\ = 0, 
/i2 = 0.5, hs, = 1, pi — 0.1, p2 = 0.15, p3 ~ 0.75, and T = 5. The three plots correspond 
to (T = 0.75 (blue), a = 0.5 (green), and a — 0.25 (red). The terminal value of J in these 
examples is given approximately by 0.73. 

H(Y) — ^ J q{y) In q{y)dy and changing variables by setting y — f{x), we find that 
H{Y) — H{X) + J p{x) ln/'(a;)da;. As a consequence, we have 



H{B, 



tT 



Hi^t) 



p^{x) hiB'{t, x) dx, 



(3.8) 



where B'{t, x) = dB{t, x)/dx. The advantage of this expression is that we need not 
determine the inverse of the function B{t, x) defined in (2.7) in order to calculate 
H{BtT)- From (2.8) it follows that 



p^{x)\nB'{t,x)dx^^ 



In 



T-t 



PtrVfT 



(3.9) 



In other words, the difference between H^Btr) and H{^t) is the average log-volatility 
of the price process at time t. 

4. Analysis of information measures 

We proceed in this section to consider the Shannon- Wiener entropy associated with 
the conditional risk-neutral probabilities. Analysis of the entropy leads to insights 
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into the qualitative behaviour of the asset price volatility. The Shannon- Wiener 
entropy is defined by the expression 



fft = - ^ TTit In TTi 



(4.1) 



We shall demonstrate that the mutual information (3.1) and the Shannon- Wiener 
entropy (4.1) are related as follows: 



J{^t,XT) = Ho-E[Ht]. 



(4.2) 



Thus, the mutual information at time t is given by the initial uncertainty, less the 
expected uncertainty that still remains at that time. 

The derivation of (4.2) proceeds in two steps. First we shall show that 



E[Ht]=Ho-^I 
and then we shall show that 



2rp2 



a'T 



{T-sy- 



VsT<^S 



2rp2 



a'T 



(T 



VsT^S 



(4.3) 



(4.4) 



Proof of (4-3). Let us begin by deriving the dynamical equation satisfied by 
the Shannon-Wiener entropy. From (2.3) and (2.5) we find that the conditional 
probability satisfies 



diTit = 



aT 
T-t 



(xi - Xt)TritdWt. 



(4.5) 



It follows, by an application of Ito's lemma to (4.1), that 



Ht — Ho 



2rp2 



a'T 



{T-s)- 



Vstds 



aT 



T 






XsWAm:^s]dWs. (4.6) 



Taking the expectation of both sides of (4.6), we obtain (4.3), as desired. D 

Proof of (4-4)- In Gel'fand & Yaglom (1957) it is shown that the mutual infor- 
mation can be expressed as the expectation of the log density of the joint measure 
/i^x with respect to the product measure fi^*^^: 



J{^uXt) 



In- 



d^ 



d{^l^*^l^ 



(4.7) 



We are thus required to determine the relevant Radon-Nikodym derivative. We shall 
follow the line of argument presented in Davis (1978) (see, also, Duncan 1970). To 
proceed, we require the introduction of an auxiliary measure B introduced in Mac- 
rina (2006) and Brody et al. (2007, 2008a). This is the so-called bridge measure un- 
der which the information process {S,t\ becomes a Brownian bridge. The argument 
goes as follows. We fix the probability space (fi, JT, Q) with a filtration {Ht}o<t<oo, 
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and introduce a Q-Brownian motion {Bt} such that the Brownian bridge {/Str} 
appearing in (2.1) is given by 

(3tT = {T-t) f -^dB.,. (4.8) 

Jo T-s 

This is the standard integral representation for a Brownian bridge (see, e.g., Hida 
1980, Protter 2005). Setting i^t = (tT/{T - t) wc define 



Kt^ = exp ( - / VsXtABs -\ I vlx'^As 



(4.9) 



where Xt is Q-independent of {Bt} and is Tio-measurable. For fixed u < T we 
introduce the measure B on Tiu by writing 

dB = A.;;MQ. (4.10) 

Then the process {H^t*}o<t<u defined by 

W; = f UsXr^s + Bt (4.11) 

JO 

is a B-Brownian motion, since VsXt is bounded for any s < u. 

Under B we find that the distribution of Xt is same as it is under Q, that {^t} 
and Xt are independent, and that {^j} is a B-Brownian bridge (cf. Brody et al. 
2008a). To see these, recall that since Xt and {Pit}, and hence Xt and {Bt}, 
are independent, the probability law of {Bt} conditional on Xt remains that of a 
Brownian motion. Now take a bounded function f{x) and consider 

E» [/(Xt)] = E [/(Xt)E[A-1|Xt]] . (4.12) 

Conditional on Xt, X^^ takes the form of a Girsanov exponential, since {Bt} is a 
Q-Brownian motion. Therefore, the inner expectation equals unity, and we find 

E" [/(Xt)] = E [/(Xt)] (4.13) 

for every bounded function f{x). In other words, Xt has the same probability law 
under Q and B. In a similar manner, for any sequence of times ti, . . . , t„ G [0, u] 
and any bounded function g : M" ^ R we wish to calculate 

E"L/(^t)5(6,,--- ,6J]=Ie[/(^t)E[5(6,,--- ,6JA-^|Xt]]. (4.14) 

By the same argument as above, for each Xt the process {VFj*|xr}o<t<tt is Brown- 
ian under the measure whose density is A^^jx^- Since {VFf*}o<i<M itself is Brownian 
under B we have 

E [5(6,, • • • ,6JA-1|Xt] = E» [g(6,, • • • ,6J] , (4.15) 

and hence 

E" [,/(^t)5(6: , • • • , 6 J] = E [/(Xt)] E» [g(6, , • • • , 6J] • (4.16) 

Working paper. 



Informed Traders 



However, B and Q coincide on Xt so that E^lfiXr)] = E[/(Xt)], from which it 
foUows that Xt and {^t} are B-independent. By combining (4.8) and (4.11) we get 

dW; = t^tXrdt + jr—^ Ptrdt + Afitx- (4.17) 

Ehminating j3tT by use of [3tT — £,t ~ criXx we obtain the relation 

d6 = -^6dt + dT4^;, (4.18) 

which is the dynamical equation satisfied by a Brownian bridge in the B measure. 
Let "$ be the map: w -^ {{^t{i^)}o<t<u, Xt{uj)} . Then the joint sample space 
measure of {{^t(w)},XT(w)} is fj,^-^{ A) ~ Q{'i'^^{A)) for any measurable set A, and 
the sample space measure of Xt is ii^{A') = Q{Xj^^{A')) for any measurable set 
A'. However, since {£,t} and Xt are independent under B, and {^t} is a B-Brownian 
bridge, we have /x^*/x^(A) = B(*"i(yl)). It follows from (4.9) that 

^=cxp(f i,,XTdB, + ^f i^^^X^dsY (4.19) 

Substituting (4.11) in here we thus deduce that 

^^^sx _ „^^ / / ,, Y^AUr* _ 1 / ,,2^2 



d(Mx*M« 



exp ( f ly.XTdW; - i / ly'^.X^dsj . (4.20) 



Turning to the innovations representation (2.5) we find, along with (4.18), that 
{Wt} and {VFj*} are related according to 

dW; = vtXtdt + dWt. (4.21) 

Thus, following a similar line of argument we obtain 

d^ 



« 



d/i 



/3 



= exp / VsXsdW; - \ I vlX^ds , (4.22 



which is a version of the likelihood ratio formula of Kailath (1971). The measure 
/^j thus corresponds to the 'signal present' situation, while the 'signal absent' case 
corresponds to {^t} being pure bridge noise with measure /i^. Combining (4.20) 
and (4.22), and making use of (4.11), we deduce 



d/x 



d(M£*Mx) 



cxpH Vs{XT-X,)dB, + \ j uliXT-Xsfd^. (4.23) 



Taking the expectation of the logarithm of this, bearing in mind that {Bt} is a 
Q-Brownian motion, we recover (4.4), as claimed. D 

The entropy process {-fft}o<t<T has the property that linit^T Ht = 0. This 
follows from the fact that the conditional probability process {T^it\Q<t<T has the 
limiting behaviour 

lim 7rrf(w) = t{XT{uj) = Xi} (4.24) 
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for i ~ 1, . . . ,n. The proof of (4.24) is as follows. Let w £ $7 be fixed, and suppose 
that Xt{i^) = Xk for some k. For this realisation of uj the information process is 
given by £,t = crtx^ + f3tT- Substituting this expression for ^t into (2.3) and dividing 
the denominator and the numerator by the exponential factor appearing in the 
numerator, we deduce that 



Pk 

T^kt = 



Pk+ Y. P] exp j^ {(j{xj -~ Xk)(3tT - ^(^^t{xj - XkY) 

3^k L 



(4.25) 



Observe that all of the terms in the sum in the denominator vanish as t approaches 
T, and therefore limj__,7^ ttj-j — 1. It follows that lirat^T ''^u = for i ^ k, and thus 
(4.24). Finally, since Ht = -Innr^i^^It" by (4.1) and since liiat^TT^u' ^ 1' ^^ 
deduce that limj^T Ht = 0. 

If we let t approach T in (4.3) we find the following relation: 



^0 = ^1 



T ^2j^ 



VsTds 



(4.26) 



Since Hq is bounded by In n, where n is the number of values Xt can take, and 
since the coefficient of the conditional variance {14t} in the integrand diverges 
quadratically as s approaches T, this relation shows that the variance process has 
to decay sufficiently rapidly to ensure the existence of the right side of (4.26). On 
the other hand, the conditional variance also generates the random movement of 
the asset price volatility in (2.4). As a consequence we are able to obtain a crude 
estimate of the magnitude of the cumulative volatility. It is worth noting that in 
models based on Brownian noise the entropy and mutual information are closely 
related to prices of variance or volatility derivatives. A related observation has been 
made by Soklakov (2008). 

It should be remarked that the limiting behaviour Ihnt^T Ht = is specific to 
the case in which Xt takes discrete values. If Xt is a continuous random variable, 
then the associated entropy has the property that limt^T Ht = — oo, which can be 
seen from the Hirschman inequality in Fourier analysis (Beckner 1975): 

Ht<^il + H2n)) + ^\nVtT. (4.27) 

It follows from (4.6) that the variance process {V^st} in this case does not vanish 
sufficiently rapidly to ensure finiteness of the right side of (4.3) as t approaches T. 
In other words, there is a qualitative difference in the behaviour of the volatility 
process, depending on whether the cash flow is a continuous or discrete random 
variable. In particular, volatility products may be overpriced in models based on 
continuous cash flows, since real market cash flows are not continuous. 

5. A model for an informed trader 

In the previous sections we have examined the BHM framework from an information 
theoretic perspective. In particular, we have been able to quantify the amount of 
information about the future cash flow of an asset contained in its price. We turn 
now to consider a model for an informed trader who has access to an additional 
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information source, apart from the price process itself, concerning the future return 
of an asset. We assume that the informed trader is 'small' in the sense that access to 
the additional information is limited, and that the actions of the informed trader will 
not impact the price process. In other words, the model is not for a large number of 
small agents; rather, it is for a single agent, or a highly restricted number of agents, 
who carefully execute their trading strategies, taking advantage of the additional 
information. 

One might expect that the use of additional information gives a definite advan- 
tage for the trader. This, however, is not necessarily the case: additional information 
is in general obscured by additional noise. As a consequence, the valuation process 
estimated by the informed trader can entail higher volatility than the actual market 
price movements. It follows that any strategy making use of additional information 
will tend to embody additional risk. Nevertheless, on average such strategies are ex- 
pected to outperform the market, and this is the idea behind some of the statistical 
arbitrage strategies adopted by hedge funds. 

The BHM framework is sufficiently flexible to model this kind of scenario. In- 
deed, the use of this framework as a basis for the development of insider-trading 
models was recognised early on (Macrina 2006). Our intention here is to apply 
such ideas to describe the disparity in the ability of processing publicly available 
information, and to illustrate how statistical arbitrage opportunities can be seen 
to arise in a simple model. It should be emphasised that many of the simplifying 
assumptions — that the asset entails a single cash flow; that the information flow 
rates are constants; that the interest rate is deterministic; and that the noises are 
modelled by Brownian bridges — can be relaxed without affecting the main qualita- 
tive features of the model. 

We assume the setup for the market outlined in §2. However, in addition there 
is an informed trader who has a further noisy information source represented by 
the information process 

^'t ^ a'tXT + P'tT- (5.1) 

The noise term {(i'^x} niay or may not be correlated with the market noise {(itx}- 
We let {Bt} and {B'^} be a pair of Brownian motions with correlation p, and define 
the associated Brownian bridges by 

PtT^Bt--^BT, and 0'^^ ^ B[ - ^ B'^ . (5.2) 

In this way we can model the two noise terms with fair amount of generality (we 
may use alternatively the integral representation (4.8) to construct the bridge pro- 
cesses; however, the choice (5.2) is more suitable for simulation purposes), since p 
determines the correlation of {Ptr} and {/3'fj^}. In particular, if \p\ = 1 then the 
informed trader has two linear equations (2.1) and (5.1) for the two unknowns 
X-T and /3tT', hence the value of the future cash flow X-r will become instantly 
accessible to the trader (assuming \a\ ^ |(j'|). This situation corresponds to the 
fully-informed conventional 'insider' often considered in the literature. The other 
extreme, for which \p\ ^ 1, is of interest, since the informed trader must choose a 
strategy optimally so as not to be overwhelmed by the additional noise. 

We let {J-i} denote the flltration generated by {^(}. If a' > a, then knowledge of 
the value of Xt is revealed at a faster rate in the 'primed' filtration. This, however. 
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Figure 2. Sample paths comparison for a defaultable digital bond. Each plot contains a 
pair of sample paths: one for which the bond does not default and one for which the 
bond defaults. The two plots on the left show the market prices with a — 0.25 (top) and 
a = 0.7 (bottom). Market prices can be compared with the valuations estimated by the 
informed trader, represented by the two plots on the right. The parameters are chosen 
to be a' = 0.4, p = 0.1 (top) and a' = 0.01, p = 0.9 (bottom). In all plots, the other 
parameters are set to be hi = 0, /12 = 1, po = 0.2, pi = 0.8, r = 0.05, and T = 1. 

does not mean that {Tt} is contained in {Tj.} even li p = 1; the two filtrations are 
merely inequivalent. On the other hand, since the informed trader also has access 
to the price process, which is adapted to {^t}, it is reasonable to assume that 
the information source is given by {Gt} = o'({Cs,Cs}o<s<t)- We assume that the 
additional information commences at time t = 0; hence the a priori probabilities 
{pi} for Xt to take the values {xi} remain the same. This assumption may seem 
limiting; however, it is not unreasonable since the 'lifetime' of the extra information, 
i.e. the period over which extra information has value, is often short in practice. 

The informed trader will use the extra information to work out the price that the 
market would have made had {Gt} been accessible to general market participants. 
We shall now calculate the valuation process made by the informed trader on this 
basis. From the Markov property of the joint information process {£,t,£,i} '^^ ^^d. 
that the informed valuation process is given by 



BtT = Ptr'^XiTrit, 



(5.3) 
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time (yrs) 



Figure 3. Comparison between averaged market prices and informed valuations. In both 
plots, the solid lines represent averaged market prices of a defaultable digital bond; one 
which default does not occur and one which the bond defaulted. The parameters are chosen 
so that Xt takes values or 1 with a priori default probability 20%, r = 5%, T = 1, and 
the market signal-to-noise ratio is cr = 0.2. These are compared with the corresponding 
averages for the informed trader (dashed lines), with two sets of parameters: a' = 0.4 and 
p = 0.1 (left); and cr' = and p = 0.95 (right). 



where tt,-, 



T — Xi\^t,^i)- By use of the Bayes formula 



TTrf 



PiPi^t^i'tl^T ^ Xi) 



where the conditional density is given by the bivariate normal density function 



(5.4) 



P{it-,^'t\XT ^x{) = 



27ri(T-i)^l-p^ 



cxp 



T 



2(1 -p2) 



{it-(rx,tf 2p{^t-(TX,t){Ct-a'xa) , iCt-^'x^tf 



t{T - t) 



t{T - t) 



t{T-t) 



(5.5) 



we deduce that 



BtT — PtT- 



ELi P^^i exp J^T-i^ {xii^^i^t + ^2^0 - jcrfxft) 



En 
^=l P^ exp 



e(T-t) 



{xi{aie.t + (T2e.'t) - \(Tixjt) 



(5.6) 



Here we set ui = a — pa' , ai = a' — pa, a"^ = a"^ — 2paa' + a'"^ , and g = 1 — p^ 
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The process (5.6) is our model for the valuations made by the informed trader. It 
is straightforward to simulate the informed valuation process {Btx} and compare 
this against the uninformed market price process {Btx}- By suitably adjusting 
the values for a, a', and p we are able to confirm various intuitive aspects of 
the behaviour of these processes; some examples are displayed in Figure 2. With 
respect to any given sample path the expected price of the informed trader at time 
t S [0, T] may be less accurate (by 'accurate' we mean close to the true value) than 
the market price. However, on average the valuation determined by the informed 
trader converges more rapidly to the true value of the bond. This is illustrated in 
Figure 3 where we plot the averaged sample paths conditional on the given outcome. 

The plot on the right side of Figure 3 indicates that the performance of the 
informed trader is high even if the signal-to-noise ratio a' of the additional infor- 
mation source is set to zero (and hence {^j} is pure noise). In fact the quality of 
the estimate decreases as the value of a' is raised from zero, until it reaches the 
critical level a' = pa. Putting the matter differently, the quality of the estimate 
m,ade by an informed trader is not monotonic in the signal-to-noise ratio of the 
additional information source. This feature may appear counterintuitive, but it can 
be understood by rearrangement of terms in (5.6) into a form analogous to (4.5): 



B, 



tT 



PtT 



PkXk + J2 PiXi exp ^(^zty ((1 - p)(o' + cr')^ifc/3tT - ^crlujf^t) 



Pk+Y.Pi exp ^(^Zjy ((1 - p)(cr + G')uJ^kPtT - \<^l^1kt) 



(5.7) 



Here we write lOik ~ Xi — Xk. This expression shows that the exponential rate 
of convergence for the process {Bit} to approach the 'true' value Xk is governed 
by the ratio cr|iT/[2p(r — t)\. In particular, for fixed a and p this ratio takes a 
minimum value at a' — pa. When a' — pa, the linear equations ^t = atXx + (itT 
and ^( — a'tXx + Pit become closest to being degenerate, and hence the value of 
the additional information is minimised. 



6. Innovations and the dynamics of informed valuations 

Our objective now is to obtain an innovations representation for the valuations 
made by the informed trader. For this purpose it sufhces to derive the dynamical 
equation satisfied by the 'insider' valuation {i?tT}j or equivalently, by the condi- 
tional probability {t^u}- The calculation simplifies if we express (5.6) in the form 



BtT — PtT 



Er=i P^^^ exp 


_^ (cTx^t - Icr^xjt) 




Ei=i Pi exp 


^ (ax,6 - \crHlt) 





(6.1) 



Here the process 



it ^atXr + PtT, 



0-2 - 2paa' + a'- 



(6.2) 
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represents the 'enhanced' information being effectively received by the informed 
trader, with the modified bridge noise 



At- 



1-/9^ \ <J — pa' a' — pa 



Z J- PtT + Y l^tT 

1 — p^ 1 — p^ 



(6.3) 



2 - 2paa' + a' 
Applying Ito's lemma to (5.4) and making use of (6.2), we find that 

^ = ^(x,-X,)AZ,^ (6.4) 

where Xt = E[XT|5t]. The process {Zt] appearing in (6.4) is defined by 

Zt - Ct + / 7^ 6d,s - / -^ Xsds. (6.5) 

Jo T-s Jo T-s 

By following a line of argument similar to that presented in Brody et al. (2007) 
it can be shown that {Zt} is a standard t/j-Brownian motion, ft follows that the 
valuation process of the informed trader obeys the following dynamical equation: 

dBtT = rtBtT<^t + TtAZt, (6.6) 

where the volatility process {Tt} is given by 

r* = ^ PtrVtT, (6.7) 

and VtT denotes the variance of Xt conditional on Qt- 

The fact that the information {^t , ^i } accessible to the informed trader can be 
'compactified' into a single enhanced information {^t} can be understood as follows. 
Since the noise processes {/Str} and {P'tx} have correlation p, one can write 



f3'tT = pf3tT + V^-P^PtT, (6.8) 

where the Brownian bridge process {Ptx} is taken to be independent of {Ptx}- 
Similarly, we can decompose the extra information {^j} in the form 



e^ = p^t + Vl^L (6.9) 



where ^t — atX^ + PtT and a — (a' — pa)/ y^l — p^ . ft should be evident that 
the filtration generated jointly by {S,t,^'t\ is equivalent to that generated jointly by 
{^(,^t}. However, the information processes {^t} and {^t} have independent noises. 
To proceed we note that the process {5t] defined by 

^^^6„6^At_At ^g^^Q^ 

a a a a 

is purely noise, and is independent of X^. We can construct a new Brownian bridge 
that is independent of this noise. A standard orthogonalisation shows that this is 
given by the bridge process {Ptx} defined by (6.3). It follows that the enhanced 
information process {^t} defined by (6.2) is independent of {5t\. Furthermore, the 
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filtration generated jointly by {^tiCt} i^ equivalent to that generated jointly by 
{^t,St}- Since {St} provides no useful information about Xt, i-e. J{St, Xx) = 0, the 
informed trader in effect has {^t} as the primary basis for valuation. 

This line of argument, making use of the orthogonalisation procedure, extends 
more generally to the case where there are multiple information processes relating 
to the cash flow Xt'- Starting with a family of processes {Cf }fe=i....,n with signal-to- 
noisc ratios {(yk}k=i,...,n one orthogonalises the associated noises. The result is a new 
set of information processes {Cf }fe=i,....n with signal-to-noise ratios {d'k}k=i....,n- 
Then the information process that the informed trader uses as a basis for valuation 
can be represented by a single effective information process {^j} with the enhanced 
signal-to-noise ratio a — {^^ ^k)^^"^- 

7. Additional information held by the informed trader and 
statistical arbitrage strategies exploiting this 

We are in a position to quantify the amount of excess information held by the 
informed trader above that of the market. This is measured by the difference of the 
mutual information: 

AJ = J(6,Xt)-J(6,^t). (7.1) 

By the argument in §3, the mutual information of the informed trader is given by 
an entropy difference of the form 

J{£,uXt) = H(it) - HiitlXr). (7.2) 

The entropy H{^t) of the 'insider' information is determined by the marginal density 
of ^t, whereas the conditional entropy H{^t\XT) is the entropy of a Brownian bridge. 
Following the line of argument presented in §4 we are able to represent the 
mutual information difference in terms of the expected entropy differences: 

AJ^E[Ht]~nHt], (7.3) 

where Ht = — ^^ tth InTTii. This expression makes it apparent that A J is nonnega- 
tive, since the entropy characterises the amount of uncertainty concerning the value 
of the cash flow Xt, and for any t G (0,T) this uncertainty is greater on average 
for the general market participant than for the informed trader. In Figure 4 we plot 
an example of A J, indicating the way in which the excess information held by the 
informed trader changes in time. 

Given that the informed trader is on average 'more knowledgable' than the gen- 
eral market participant it is natural to ask how this advantage can be turned into 
profit. One of the issues that can be addressed in this connection is the derivation 
of optimal trading strategies. For such an analysis one may need to introduce addi- 
tional structure into the problem in the form of a suitable criterion for optimality 
and a specification of the market price of risk. In the present investigation, we con- 
fine the discussion to a demonstration, supported by simulation studies, of how even 
very simple strategies can yield statistical arbitrage opportunities by outperforming 
the market. 
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Figure 4. Mutual information difference. The additional information held by the informed 
trader over that of the market is nonnegative. The parameters are set to be hi = 0, /12 = 1, 
pi = 0.2, p2 = 0.8, T=l,a = 0.25, a' = 0.45, and p = 0.15. 



For example, suppose we consider a strategy such that at some designated time 
t £ [0, T] a market trader purchases a digital bond iff at that time the bond price 
BtT is greater than KPtx for some specified threshold K. The value of K can be 
regarded as the risk aversion level of the trader. An informed trader follows the same 
rule, but makes a better estimate for the value of the bond, and hence purchases 
the bond iff Bit > KPtT- In either case a bond that is purchased will be held until 
maturity. That such a strategy leads to a statistical arbitrage opportunity for the 
informed trader can be seen as follows. We assume that the initial position of the 
trader is zero, i.e. purchase of a digital bond at t requires borrowing the amount 
BtT at that time, and repaying the amount P^ Bit at T. Thus the value of the 
market trader's portfolio at T is 



Vt = t{BtT > KPtrHXr 



Ptr'BtT) 



whereas the terminal value of the informed trader's portfolio is 

Vt = t{BtT > KPtTJiXT ~ PtT^tT)- 



(7.4) 



(7.5) 



Consider now the present value PotIE[AVt] of a security that delivers a cash flow 
equal to the excess P&L AVr = Vt — Vt generated by the strategy of the informed 
trader. By use of the tower property we have IE[AVt] = ]E[E[AVT|5t]]; but 



E[AFT|ei] = P^r^ {\{BtT > KPtT} - HBtT > KPtT}) [BtT - BtT 



(7.6) 
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Figure 5. The P&L difference for digital bonds. At each time the traders purchase the 
bond if and only if the valuation of the bond is greater than a specified threshold. The 
general market trader buys if Btx > KPtr, whereas the informed trader uses the condition 
BtT > KPtT- The difference in profit and loss between the informed trader and the general 
market trader is plotted, based on 2000 realisations, when the a priori probability of 
default is pi = 0.2. Other parameters are set to be /ii = 0, /12 = 1, T = 5, a = 0.25, 
g' = 0.45, p = 0.15, and K = 0.7. 



since the random variables BtT and Bit are both ^j-measurable. If B^t > Btx 
then t{BtT > KPit} — Iji^tT > KPtx} is nonnegative, whereas if BtT < BtT 
then t{BtT > KPtT} - l{^tT > KPtT] is nonpositive. It follows that E[AVT|5t] 
is a nonnegative random variable, and hence E[AVt] > 0, since E[AVT|5t] > 
with probability greater than zero. Therefore, the informed trader can execute a 
transaction at zero cost that has positive value, and this is what we mean by 
'statistical arbitrage'. 

We have examined the profit and loss (P&L) profile, both for a general market 
trader and for an informed trader, resulting from the repeated application of such 
a strategy. The results are plotted in Figure 5. In particular, we consider 2000 
realisations (sample paths) for the information processes governing the bond price 
valuations. For each fixed t G [0, T] we calculate the total P&L for the informed 
trader and for the market trader obtained by following the specified strategy over 
and over for each of the 2000 independent sample paths. For each fixed t, we chart 
in Figure 5 the difference between the total P&L of the informed trader and that 
of the market trader. Providing that the strategy is executed after enough time 
has passed for the informed trader to gain an informational advantage, we find 
that the difference between the P&L of the informed trader and that of the market 
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trader is always positive. Furthermore, the quahtative behaviour of the resulting 
P&L difference is in agreement with the qualitative behaviour of the magnitude 
of the excess information possessed by the informed trader indicated in Figure 4 
(we have chosen the same parameter values for these two figures to allow for direct 
comparison). 

Our objective has been to demonstrate how statistical arbitrage strategies arise 
in a market characterised by heterogeneous information flow. It is interesting that 
a qualitatively similar behaviour for the excess information and the excess P&L 
is observed in the case of the rather primitive strategy we have considered here. 
There are many ways in which one can improve upon the trading strategy examined 
above. An important open issue is to determine the optimal trading strategy, subject 
to suitable optimality criteria, that exploits the excess information. We conclude 
by remarking that a related approach to the modelling of insider trading within 
the information-based framework is suggested in Macrina (2006), where the asset 
return is modelled as being dependent on more than one market factor, for which 
only some of the associated information processes are accessible to the market. It 
would be of interest to examine whether the kind of hedge fund strategy considered 
here is also applicable in a setup involving multiple market factors. 

We thank J. Z. Kelly, A. Macrina, B. K. Meister, and M. F. Parry for stimulating discus- 
sions. 
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